General properties of Nonsignaling Theories 
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This article identifies a series of properties common to all theories that do not allow for super- 
luminal signaling and predict the violation of Bell inequalities. Intrinsic randomness, uncertainty 
due to the incompatibility of two observables, monogamy of correlations, impossibility of perfect 
cloning, privacy of correlations, bounds in the shareability of some states; all these phenomena are 
solely a consequence of the no-signaling principle and nonlocality. In particular, it is shown that for 
any distribution, the properties of (i) nonlocal, (ii) no arbitrarily shareable and (iii) positive secrecy 
content are equivalent. 

PACS numbers: 



I. INTRODUCTION 

There are two experimental facts that, when consid- 
ered together, significantly restrict any possible physi- 
cal theory that aims at accounting for them. The first 
one is the constancy of the speed of light in any refer- 
ence frame. This implies that no signal carrying infor- 
mation can propagate faster than light. More generally, 
we refer as the no-signaling principle the impossibil- 
ity of sending information arbitrarily fast. The second 
fact, is the existence of correlations between space-like 
separated events that violate Bell inequaHties [1, 2]. 
This means that such correlations cannot be explained 
by strategies arranged in the past. Models accounting 
for such correlations can be constructed by assuming 
some signaling between the correlated events. But this 
seems to contradict the first experimental fact. This 
is the reason why such correlations are called nonlo- 
cal. Despite this, physical theories exist that predict 
the violation of Bell inequalities and are nonsignaling, 
an example being Quantum Mechanics (QM). 

QM is not the unique theory consistent with the two 
mentioned experimental facts. It is well known that 
there exist nonsignaling correlations that are more non- 
local than the ones predicted by QM. Indeed, Popcscu 
and Rohrlich proved that there are nonsignaling cor- 
relations giving a Bell inequality violation larger than 
the quantum mechanical prediction [3]. This suggests 
the possible existence of theories, different from QM, 
that allow for Bell inequality violation without contra- 
dicting the no-signaling principle. Although there is 
no experimental reason to reject QM, it is highly de- 
sirable to know the nature of these alternative theories 
in order to " study quantum physics form the outside" . 
In this article, we aim at providing a unified picture 
for the static part [we do not consider dynamics] of all 
such theories, identifying a series of features common 
to all of them. 

Analyzing these common properties can be very use- 
ful in gaining a better understanding of QM. It is often 
said that the postulates of QM do not have a clear 



physical meaning, especially when compared with the 
postulates of other theories, like Relativity or Thermo- 
dynamics. The postulates of QM imply no-signaling [if 
we assume locality of interactions] , and nonlocality. It 
was proposed by Popescu and Rohrlich to consider no- 
signaling and the existence of nonlocal correlations as 
proper physical principles. Could these two principles, 
together with other independent postulates imply QM? 
What would these other postulates look like? For such 
an enterprise, it is very important to learn all the con- 
sequences that follow from these two principles without 
any extra assumption. 

From an information-theoretical point of view, it is 
also worth looking at a framework more general than 
QM, as illustrated by several recent works analyzing 
the use of nonlocal correlations as an information- 
theoretical resource [4, 5]. This is of particular interest 
in the case of secret communication: there, the secu- 
rity of a protocol relies on some assumptions on the 
eavesdropper capabilities. Usually, it is assumed that 
her computational power is bounded, or that her ac- 
tion is constrained by QM laws. It is then desirable to 
weaken the strength of these assumptions as much as 
possible. In this sense, a secret key distribution was 
recently proposed in [6] and its security proved solely 
using the no-signaling principle. In this article, we ex- 
tend the connection between nonlocality and secrecy at 
the level of an equivalence. Notice that the fact that a 
probability distribution contains secrecy does not im- 
ply that it can be distilled into a secret key (see below) . 



A. Summary and results 

The article is organized as follows: in section 2 
nonsignaling correlations are introduced, local and 
nonlocal ones are distinguished. Special emphasis is 
made on a particular family of distributions that we 
call isotropic, which will prove very useful in later rea- 
sonings. 

In section 3, different aspects of monogamy in nonlo- 
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cal correlations arc presented. In particular, the com- 
plete equivalence between locality and infinite share- 
ability is proven (section 3.1). In section 3.2, through 
some examples, we survey the complex structure of the 
monogamy relations. 

In section 4 we prove that, any nonsignaling theory 
that predicts the violation of at least one Bell inequality 
has a No-Cloning Theorem. Some additional analysis 
is made for the case of QM. 

In section 5 wc prove that, nonsignaling correlations 
contain secrecy (in the sense of cost) if and only if they 
are nonlocal. 

In section 6 wc review the fact that all nonlocal cor- 
relations must have nondeterministic outcomes. And, 
in section 6.1 we show that, the more incompatible two 
obscrvablcs are, the more uncertain their outcomes. 

Finally, we conclude with some final remarks, expos- 
ing some open question. Some additional material and 
proofs is contained in the appendixes. 



II. DEFINITIONS AND GENERAL FRAME 

Consider n-parties — Alice, Bob, Clare. . . each 
possessing a physical system, which can be measured 
with different observables. Denote by Xk the observ- 
able chosen by party k, and by au the corresponding 
measurement outcome . The joint probability distribu- 
tion for the outcomes, conditioned on the observables 
chosen by the n parties is 

P(ai,...,o„|xi,...,a;„). (1) 

One can formulate this scenario in an equivalent and 
slightly more abstract way. Imagine that each of the 
n parties has a physical device with an input and an 
output. Just after the /e*'^ party inputs Xk, the device 
outputs Ofe, and it cannot be used anymore. Through- 
out this article, we assume that inputs and outputs 
take values from finite, but arbitrarily large, alphabets: 
Xk e {0, 1, . . . , Xfc - 1} and Ofc e {0, 1, . . . , A/t - 1}. 
Notice that, without loss of generality, we assume that 
all observables belonging to one party have the same 
number of outcomes. 

It is useful to look at these conditioned probability 
distributions (1) as points in a large dimensional space. 
The set of all these points (1) is a convex polytope. 
Unless no other constraints are imposed, (1) can be any 
vector of positive numbers, satisfying the normalization 
conditions 

P(ai,...a„|xi,...a;„) = 1 (2) 
for all input values Xi, . . .x„. 



A. Nonsignaling correlations 

The n-partite distribution P(oi, . . . a„|a;i, . . . a;„) is 
nonsignaling, when the marginal distribution for each 
subset of parties {ak^ , ■ ■ ■ afe„ } only depends on its cor- 
responding inputs 

P{aki,. ..ak^\xi, ...Xn) = P(afci, • ..ak^\xki,. .-Xk^). 

(3) 

It turns out that very few of these conditions are lin- 
early independent. It was proved in [4], that all condi- 
tions of the form (3) can be derived from the following 
Condition: For each k & {1, . . .n} the marginal dis- 
tribution obtained when tracing out Uk is independent 

of Xk'- 

^ P(ai, . . . ttfe . . . an\xi, ...Xk...Xn) (4) 

Ok 

= Y P{ai, . . . ttk . . . an\xi, . . . x'k . . . Xn), 
for all values of ai, . . .ak-i,ak+i ■ ■ ■Q'm and 

Xi. . . . Xk , Xj., Xk+l ■ ■ ■ Xn- 

These linear constraints characterize an affine set. 
The intersection of this set with the polytope of distri- 
butions (1) gives another convex polytope. Through- 
out this article, whenever we refer to distributions, cor- 
relations, states or points, we always assume they be- 
long to the nonsignaling poljdiope. 

B. Local correlations 

Local correlations are the ones that can be gener- 
ated if the parties share classical information, or equiv- 
alently, the ones that can be written as 

P{ai,...an\xi,...Xn) (5) 
= X] ^(e)f'(ai|a:i, e) • • • P(a„|a;„, e). 

€ 

This subset of correlations is a convex polytope delim- 
ited by two kinds of facets. The first kind warrants 
that all the components of (5) are positive, and thus, 
it is not interesting. Actually, they are already facets 
of the nonsignaling (and also of the more general) poly- 
tope. The second kind are the Bell inequalities, which 
can be violated by nonlocal correlations. Throughout 
this article we assume that all Bell inequalities have 
been normalized [with a transformation of the form 
B aB + P, where a and /3 are real numbers ] , such 
that the local bound is S[Plocal] < 0, and the maximal 
violation compatible with no-signaling is ^BIPmax] = 1- 
As said above, local correlations can be generated 
with shared randomness and local operations. In ex- 
pression (5), the random variable e stands for the in- 
formation shared among the parties, sometimes called 
local hidden variable. Depending on its value, the k^^ 
party locally generates P{ak\xk, e). The distributions 
that cannot be written like (5) are called nonlocal. 
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C. Quantum correlations 

We call quantum those correlations that can be gen- 
erated if the parties share quantum information [entan- 
glement] , or equivalently, those correlations that can be 
written as 



P(ai, . ..an\xi, ...x„)=tr 



ai 



p 



(6) 

where p is a quantum state, namely a unit-trace, semi- 
definite positive matrix, and {F^'^''\ . . .F^^^}_^} define 
what is called a positive operator valued measure [8]. 

That is, a set of positive operators {F^^''^} satisfying 



1, \/Xk. 



D. Isotropic correlations 



Let us define a particular family of bipartite distri- 
butions with binary input/output. In the case where 
the marginal distributions for a and b are unbiased, all 
the information of P{a,b\x,y) is contained in the four 
correlation functions: 



P{0,0\x,y)+P{l,l\x,y) (7) 
P{0,l\x,y)-P{l,0\x,y), 



for xy — 00,01,10,11. One can always fix 
Coo , Cqi 1 C'lo > by performing local reversible trans- 
formations. Once we have a distribution in this canon- 
ical form, its nonlocality is decided by the CHSH in- 
equality [7] in standard form: 



ScHSH — 2 [Coo 



01 



Cio — Cii] — 1. 



(8) 



We call isotropic, denoted by Piso(a, b\x, y), those cor- 
relations with unbiased marginal distributions for a 
and b that satisfy 



Co 



Coi — Ci 



-Cn > 0. 



(9) 



This family depends on a unique parameter C — Cqo, 
whose relation to the CHSH violation is 



^chsh[-Piso] — 2C 1. 



(10) 



In figure 1 we can see for which values of C the dis- 
tribution Piso belongs to the local and quantum set. 
When C = 1, this distribution is known as PR-box 
[3, 4], and is usually written as 



PpR(a, 6|a;,y) 



1/2 if a + bmod 2 ■ 
otherwise 



xy 



(11) 



This distribution can be considered the paradigm of 
nonlocal and nonsignaling correlations (see [9]). With 
this definition, we can express any Piso a-s the following 
mixture 



Pis 



CPp 



(1 - C)P^P^, 



(12) 



Nonsignaling correlations 




Quantum Mechanics 






Local models 
\ 1 




► 



2 



1 



c 



FIG. 1: Value of C for isotropic correlations (12). 



where P^ is the local noise distribution for Alice, in- 
dependently of the inputs. Thus, one can interpret C 
as the probability of sharing a PR-box instead of local 
noise. 



III. MONOGAMY OF NONLOCAL 
CORRELATIONS 

While classical correlations can be shared among 
an indefinite number of parties, it is well known that 
quantum correlations cannot. This fact is often called 
monogamy of entanglement [15]. In this section we 
prove that this is a generic feature of all non-signaling 
theories. 

First, let us recall a result already mentioned in [4]. 
All Bell inequalities for which the maximal violation 
consistent with no-signaling is attained by a unique dis- 
tribution, have monogamy constraints. Suppose that 
S is a Bell inequality with unique maximal violator 
Pmax- If Alice-Bob maximally violate this inequality 
B\P{a,b\x,y)] = 1, then, Alice and Clare are com- 
pletely uncorrelated. To prove this, first notice that 
because all Bell inequalities B[P] are linear in P, Pmax 
must be an extreme of the Alice and Bob polytope. 
Otherwise, the maximal violator would not be unique. 
Second, using the definition of marginal distribution 
and the no-signaling condition we have 

PMAx(a, 6|a;,y) = ^P(a,5, c|x,y, z) 

C 

= ^P{a,b\x,y,z,c)P{c\x,y,z) 

C 

= Y,P{a,b\x,y,z,c)P{c\z), (13) 

c 

for all z. But, because PM_Ay.(fl,b\x,y) is extremal, any 
such decompositions must consist of only one term. 
This implies that Clare is uncorrelated with Alice and 
Bob. 

Actually, one can prove that all the CGLMP inequal- 
ities have a unique nonsignaling probability distribu- 
tion achieving its algebraic maximum. This well-known 
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set of inequalities was first proposed in [16] for the case 
of two inputs of d possible outputs. One can easily see 
that imposing no-signaling and maximal violation of 
CGLMP inequality identifies a unique probability dis- 
tribution P{a, b\x, y). This means that this set of Bell 
inequalities have the previous monogamy condition. 



with each distribution P{hi\yi.e) being equal to the 
P{b\y, e) that appears in (5). We can merge the previ- 
ous two statements in the following one: 

Result 2: locality and oo -shareability are equivalent 
properties. 



A. m-shcireability and locality 

Shareability represents a natural property in the 
analysis of the monogamy of correlations. A bipar- 
tite probability distribution P{a,b\x,y) is said to be 
m-shareable with respect to Bob, if there exists an 
(m-l-l)-partite distribution P{a, bi, . . . b.^\x, yi, . . . ym) 
being symmetric with respect to {bi,yi) ■ ■ ■ {bm,ym), 
with marginals P{a,bi\x,yi) equal to the original dis- 
tribution P(a,h\x,y). The following result shows the 
relation between shareability and nonlocality. 

Result 1: If P{a,b\x,y) is m-shareable with respect 
to Bob, then, it satisfies all Bell inequalities with m (or 
less) different values for the input y. 

Proof: To prove this statement, we construct a 
local model for P{a,b\x,y) when y constrained to 
y = 1 , . . . m [without loss of generality] . By as- 
sumption P{a,bi, . . .bm\x,yi, . ■ .ym) exists, then so 
P{bi,...bm\yi,...ym) and P{a\x,bi, . . .bm,yi, . ■ .ym) 
do. In this local model, the information shared by the 
parties is the string . . . bm), when the correspond- 
ing inputs are fixed to yi = 1, . . . = m. Thus, using 
the definition of conditional probabilities, we can de- 
compose P{a,b\x,y) in the following way 

P{a,b\x,y)= Pibu...bm\-i-,...m) (14) 

bi,...bm 

xP{a\x, 6i, . . . 6m, 1, . . . m)5b,by ■ 

Where the three factors in each term of the sum have 
to be interpreted as the P{e), P{a\x,e) and P{b\y,e) 
appearing in the decomposition (5), respectively. 

Note that this result represents the extension of The- 
orem 2 in [10], derived for quantum states, to the more 
general nonlocal scenario. It also implies that if a state 
is X/F-shareable with respect to Alice/Bob, then it is 
local. In particular, two-shareable states do not violate 
the CHSH nor the CGMLP inequalities. 

A converse of the previous result is also true: if a 
state is local, then it is oo -shareable with respect to any 
party. To show the last statement, we explicitly con- 
struct the extension [to m Bobs] for the arbitrary local 
correlations written in (5): 

P{a,bi,...bm\x,yi,...ym) (15) 
= X] -P(e)-P(a|a;, e)P{bi\yi,e) ■ ■ ■ P{bm\ym, e), 



This result is analogous to what happens in QM: a 
bipartite quantum state is oo-shareable if and only if 
it is separable [14]. 



B. Examples 

In what follows, we show that the CHSH inequality 
presents an even stronger kind of monogamy. 

Result 3: Consider a binary input/output tripar- 
tite distribution P{a,b,c\x,y, z). If Alice and Bob's 
marginal is nonlocal, then Alice and Clare's marginal 
must be local. 

BcHSH[P{a,b\x,y)] > BcHSH[P{a,c\x, z)] < 



(16) 



Proof. We prove this statement by contradic- 
tion. Suppose that there exists a tripartite distribu- 
tion P(o, b, c\x, y, z) such that both P{a, b\x, y) and 
P(a, c\x, z) arc nonlocal. Then Alice-Bob, and simulta- 
neously Alice-Clare, can depolarize their bipartite cor- 
relations and transform them into isotropic ones, with- 
out decreasing the Bell violation. This procedure is 
shown in Appendix B. Then, if Alice-Bob have larger C 
than Alice-Clare, Bob decreases it until both are equal 
(this procedure is explained in Appendix A). An anal- 
ogous thing is done in the opposite situation. After 
this manipulations, both marginals are isotropic and 
have the same value of C. This implies that the two 
marginals are equal, and thus two-shareable. In sec- 
tion 3.3 we have seen that, a two-shareable state can- 
not violate CHSH. This finishes the construction of the 
contradiction. 

In more general situations strict monogamy no 
longer holds. Indeed, one can easily design a situa- 
tion where Alice shares a PR-box with Bob, and an- 
other with Clare. This corresponds to a case where 
Alice can choose between 4 inputs of 4 outputs, while 
Bob and Clare are restricted to the simplest case of 
Y = Z = B = C = 2. Clearly, the corresponding Alice- 
Bob and Alice-Clare distribution violate the CHSH in- 
equality. A nicer and more symmetric example, with 
only two inputs for each party, is given by the following 
tripartite distribution 



jABC 



1 pAB pC 
2^pr{0,1}^n{0,1} 



1 



-Pi4't2,3}-Ppf{2,3}' (!''') 
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where Prn{a,f3} is a PR-box with outputs restricted to 
a,b € {a,/3}, -Pn{q./3} is a local noise distribution with 
outputs restricted to a, b e {a, j3}, and the superindiccs 
label the parties. In what follows, we prove that the 
Alice-Bob marginal 

pAB _ ^ pAB j_^pA pB /io\ 

^ — 2 pr{o,i} 2 '^{^'^^ ^ ' 

is nonlocal. Assume the opposite: P^^ can be ex- 
pressed as a mixture of local extreme points (5). Be- 
cause each local extreme point has determined out- 
comes, we can split the local mixture into a part with 
outcomes {2, 3}, and a part with outcomes {0, 1}. The 
last, would correspond to a local expansion of P^J^q ^y, 
but we know that such thing does not exist. Now, using 
the symmetry of (17), we conclude that its marginals 
P^^ and P^*^ are both nonlocal. 

In the case X = Y = 2 and A, B arbitrary, there is 
a situation where strong monogamy still holds: where 
the reduced states of Alice-Bob and Alice-Clare, consist 
both on isotropic correlations with non-uniform noise 
[independent of the inputs]. First, let us generalize 
the idea of isotropic distributions for arbitrary output 
alphabets. The generalization of the PR-box is [4] 

T> I u \ f IM if a-hm.odA = xy 
P,^{a,h\x,y) = ^^ otherwise " ^^^^ 

In a natural way, we define 

p^i^ = CPA"" + (1 - c)p,iuP^L, (20) 

where Pj^^ is an arbitrary local distribution for Al- 
ice, independent of the inputs. It is clear that if Alice 
and Bob add to their outputs a shared random number 
modulo A: 

a^a + r mod A (21) 
b^b + r mod A, (22) 

their distribution becomes: 

P.io - CP^^ + (1 - C)P^P^, (23) 

where Pn is the (local) uniform distribution inde- 
pendent of the inputs x/y. As in the case A = B = 2, 
if C is positive, one of the parties can decrease its value 
by performing a local operation. Using the same trick 
as before, one can prove that all tripartite distribu- 
tions where the marginals Alice-Bob and Alice-Clare 
are both isotropic with non- uniform noise (20), show 
strong monogamy. 

IV. NO-CLONING 

The Quantum No-Cloning Theorem represents one 
of the cornerstones of Quantum Information Theory. It 



is usually explained as a consequence of the nonorthog- 
onality of quantum states and the linearity of quantum 
time evolution. The relation between quantum cloning 
and no-signaling has also been studied by several au- 
thors. Indeed, if one assumes that (i) states are de- 
scribed by vectors in Hilbert spaces, (ii) probabilities 
are obtained according to the usual trace rule, and (iii) 
no-signaling, the optimal fidelity of a cloning machine 
cannot be larger than the one allowed by quantum dy- 
namics [11]. In what follows, we formulate the problem 
independently of QM and show that 

Result 4: All nonsignaling theories predicting the 
violation Bell inequalities have a no- cloning theorem. 

A similar result was proved for the case of the CHSH 
inequality by R. F. Werner, in [12]. Here we prove it 
for general nonlocal theories, not necessarily violating 
the CHSH inequality. Suppose that there exists a ma- 
chine to which we can input a physical system [in an 
arbitrary state], and it outputs two systems in exactly 
the same state as the original one. We call such engine 
perfect cloning machine. Let us consider the following 
situation: Alice and Bob share the nonlocal distribu- 
tion P(a, b\x, y), and perform the following two space- 
like separated events. On one site, Alice chooses the 
input Xq and obtains the output oq. On the other site. 
Bob performs m clones of its original system. For an 
observer who sec first the event on Alice's site, the de- 
scription of Bob's input system is P(6|j/, xq, ag). For 
this observer, Bob's system is completely uncorrelated 
with the rest of the universe, and the functioning of 
the perfect cloning machine is unambiguous: 

P{b\y,xo,ao) P{bi, . . .bm\yi, ■ ■ ■ym,xo,ao) (24) 

Obviously, the joint sate of all clones 
P{bi, . . .bmlvi, ■ ■ ■ym,XQ,aQ) is such that when 
we trace all but one, P{bi\yi,xo,ao), this distribution 
is the same as the original one, P{b\y, xq, oq)- Because 
we consider a perfect cloning machine there is no 
distinction between pure and mixed states: all are 
perfectly cloned. For an observer who first sees Bob's 
operation, its description of the physical situation is 

P{a,bi,...bm\x,yi,...ym) ■ (25) 

But, because all descriptions must give consistent pre- 
dictions, the descriptions from the point of view of the 
two mentioned observers (24) and (25) must be the 
same, up to conditioning on a. This implies that the 
original distribution P{a,b\x,y) is m-sharcable. More 
concretely, because m is arbitrary, we can say that 
P{a,b\x,y) is oo-shareable. According to the result of 
section 3.1, the original distribution P{a,b\x,y) must 
be local, in contradiction with the initial assumption. 
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A. Phcise covariant cloning machine 

Once we have ruled out the existence of a perfect 
cloning machine, it is interesting to look for the optimal 
imperfect one. Suppose that its action is 

P{a,b\x,y) — > P(a,bi,b2\x,yi,y2), (26) 

where, without loss of generality we can assume that 
the final distribution is symmetric with respect to 
(&i,yi) and (62,2/2)- By definition, the reduced dis- 
tribution P(a,bi\x,yi) is two-shareable. This implies 
that it cannot violate any two-input Bell inequality. 
In particular, if the initial distribution P(a,b\x,y) has 
Y = 2, the resulting clones are correlated with Alice's 
system in a local way. 

Let us consider a particular case in the binary in- 
put/output scenario. Consider that Alice and Bob 
share an isotropic distribution with parameter C. Bob 
clones his subsystem, and, according to the previous 
paragraph, the resulting clones are locally correlated 
with Alice's subsystem. If we suppose that the clones 
are isotropically correlated with Alice, the maximum 
value for their parameter is Ccln = 1/2. Thus, the 
shrinking factor associated to this cloning operation is 



C ~ 2C- 

Now, consider the isotropic correlations that arise when 
measuring a singlet with the observables that maximize 
the CHSH violation, that with C = I/V2. In 

this case, the shrinking factor (27) coincides with the 
one of the phase covariant quantum, cloning m,achine 
I/V2 [17], that is QM attains this maximum value for 
the cloning of nonlocal correlations. In this sense, QM 
clones the quantum correlations achieving the Cirelson 
bound in an optimal way. 

V. NON-LOCALITY AND PRIVACY 

The monogamy of correlations and the impossibility 
of perfect cloning seem immediately to be related to the 
concept of privacy. If two honest parties know to share 
correlations with some degree of monogamy, they can 
estimate and possibly bound their correlations with a 
third dishonest party, the eavesdropper. In this section 
we strengthen this intuitive idea, proving that under 
the no-signaling assumption, a probability distribution 
contains secrecy if and only if it is nonlocal. Recall that 
this does not mean that this probability distribution 
can be transformed into a secret key. 

For the sake of simplicity we consider the bipartite 
case. In a cryptographic scenario, one usually consid- 
ers two honest parties (Alice and Bob) each possess- 
ing a random variable A and iJ, and an eavesdropper 
(Eve) having E. The correlations among the three ran- 
dom variables are described by a probability distribu- 
tion Pab e ■ On the other hand, it is meant by nonlocal 



correlations those probability distributions conditioned 
on some inputs P{a,b\x^y) that cannot be written in 
the form of Eq. (5). It is in principle not so evident how 
to relate the two scenarios. For instance, how to add 
(i) the third party in the nonlocal scenario or (ii) the 
missing inputs for Alice and Bob in the cryptographic 
scenario. Therefore, before proving the equivalence be- 
tween privacy and nonlocality one has to connect the 
two considered scenarios. 



A. Secret correlations 

A tripartite probability distribution [without inputs] 

Pabe among two honest parties and an eavesdropper 
contains secrecy when it cannot be generated by local 
operations and public communication (LOPC), i.e. its 
formation requires the use of a private channel or secret 
bits [18]. On the other hand, Pabe can be generated 
by LOPC, if there exists a stochastic map E ^ E' such 
that 

Pab\e' = Pa\e'Pb\e'- (28) 

We say that Pab e contains secrecy [18] when this is not 
possible. We stress that this does not mean that many 
copies of Pab e can later be used to obtain a secret key 
by LOPC. Indeed, there are probability distributions 
with positive secrecy content, which cannot be distilled 
into a secret key by LOPC [19]. 

Now, suppose Alice and Bob share a distribution 
P{a,b\x,y). They decide the inputs according to uni- 
form distributions: p{x) = 1/X and p{y) = l/F [20]. 
Then, Alice's and Bob's information is respectively 
A = {a,x) and B = {b,y). The random variables A 
and B are correlated according to 

PAB = P{a,b\x,y)^. (29) 

Can Alice and Bob bound Eve's information on their 
outcomes from their observed correlations? Can one 
prove that all possible extension Pabe of Pab, de- 
rived from P{a,b\x,y) through Eq. (29), contain se- 
crecy? This is of course impossible if no assumption 
on the possible extensions are made. In general, Alice 
and Bob can never exclude that Eve has a perfect copy 
of their outcomes, unless some constraints are imposed. 
However, if it is assumed that no faster-than-light com- 
munication is possible, not all possible extension of the 
initial bipartite probability distribution are allowed. 
Let us only consider extensions -P(a, 6, e\x, y) compati- 
ble with no-signaling. Thus, to each P{a, b\x, y) we can 
associate a family of tripartite distributions 

Pabe = P{a,b,e\x,y)^y, (30) 

where E = e. We say that P{a, b\x, y) contains secrecy 
if all its associated Pabe contain secrecy. 
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B. All nonlocal correlations contain secrecy 

The aim of this section is to show the Hnk be- 
tween the nonlocal properties of P{a, b\x, y) and the se- 
crecy content of any possible extension Pabe, defined 
through (30). Before proceeding, note that an equiva- 
lent way of defining local correlations is as follows: a 
probability distribution P{a,b\x,y) is local (5) when 
there exists a [nonsignaling] extension P{a,b,e\x,y) 
such that 

P{a, b\x, y, e) = P(a|x, e)P{b\y, e). (31) 

Now, assume one has a bipartite distribution 
P{a,b\x,y) for which there exists an extension Pabe 
with no secrecy content, that is 

Pab\e = Pa\ePb\e- (32) 

Because processing the outcomes of a nonsignaling dis- 
tribution gives another nonsignaling distribution, any 
transformation E ^ E' is included in the arbitrariness 
of the extension P{a, 5, ejx, y). By using the definition 
of conditional probabilities, one can see that (32) is 
equivalent to (31). That is, Pabe has no secrecy if 
and only if there exists an extension of P(a, b\x, y) sat- 
isfying (31), which is to say that P{a,b\x,y) is local. 
This establishes the following equivalence. 

Result 5: A distribution contains secrecy if and 
only if it is nonlocal. 

It was already proven in [6] , that all local correlations 
(5) can be distributed by LOPC. The public message 
that one of the parties, say Alice, should send to the 
rest in order to create the correlations, is precisely the 
(hidden) variable e that appears in (5). Therefore, if 
Alice and Bob's probability distribution is local, they 
cannot exclude that the global probability distribution 
including Eve does not contain any secrecy. 

The following natural question is to identify those 
nonlocal correlations distillable to a secret key and 
whether they can be distributed using quantum states 
[22]. This will define those quantum correlations se- 
cure against an eavesdropper only limited by the no- 
signaling principle [6]. 



VI. NONLOCALITY AND RANDOMNESS 

We first start by showing that all nonlocal correla- 
tions have random outcomes (see also [3]). Consider 
a deterministic bipartite distribution Pj^ET.{a,b\x,y). 
That is, a and b arc deterministic functions of [x.y). 
Using this and no-signaling, we can get the following 



equalities 

PDET(a, 6|a;,y) = 5(^a,b),{f[x,y\,g[x,y]) 

= P{a\x,y)P{b\x,y) 

= Pia\x)P{b\y). (33) 

The last line is a distribution of the form (5). There- 
fore, all deterministic distributions are local. Or in 
other words, all nonlocal states have uncertain out- 
comes. This fact can be straightforwardly extended 
to the n-party case. Thus, there are two kinds of ran- 
domness in any nonsignaling theory with nonlocal cor- 
relations. The first one reflects our ignorance and cor- 
responds to those probability distributions that can be 
written as the convex combination of extreme points. 
But, like in QM, there is also an intrinsic randomness 
even for extreme points, or pure states. The PR-box 
(11) is an example of a pure state with uncertain out- 
comes. 



A. Incompatible observables and uncertainty 

Finally, within QM it is said that two observables 
(Ooj Oi) are compatible if there exists a more complete 
one O of which both are functions: (Oo,0i) = f{0). 
Consider P{a,b\x,y), we say that the two observables 
in Bob's site 6o and bi [corresponding to the inputs y = 
0, 1] are compatible, if there exists a joint distribution 
for both P'{a, bo, bi\x). That is 

Y,P'ia,bo,bi\x) = P{a,bi\x,y=l) , (34) 

bo 

J2P'{a,bo,bi\x) = P{a,bo\x,y = 0) . (35) 

bi 

Or in other words, P{a,b\x,y) is two-shareable with 
respect to Bob if we restrict to ?/ = 0, 1. 

When the observables (60,^1) are not compatible, a 
possible way of quantifying the degree of incompatibil- 
ity is 

inc[6o, ''i] = minjij > : P{a, b\x, y) (36) 
= r]Pi^c{a,b\x,y) ■ti)PcoM{a,b\x,y)], 

where Pcom((i, 6|a;, y) is a distribution where 60 and 

hi are compatible, and, Pl^cXo■^^^^y) is an arbitrary 
one. It is clear that the range of inc[feo,&i] is [0,1], 
and inc[&05 ^1] = if and only if &o and 61 are com- 
patible. In Appendix B it is proven that in the binary 
input/output case, this minimization yields the CHSH 
violation: 

inc[6o, hi] = B CHSH [P{aMx,y)]. (37) 

In the case of binary outputs or inputs, we are able 
to establish a direct relation between inc[6oj &i] and the 
uncertainty of 60 and 61: 
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Result 6: In the binary output case [A = B = 2] 
the following constraints hold: 

H{bo) > /iQinc[6o,6i]) , (38) 

H{h) > h(^^mc[bo,bi]y (39) 

where H(b) is the entropy of the output b, and h{x) is 
the binary entropy of x [13]. These inequalities also 
hold in the binary input case [X = Y = 2], and are 
still tight. 

The proof of this result is in Appendix B. Although 

this has the flavor of the Hciscnbcrg uncertainty rela- 
tions, it diS'ers in the fact that here we do not have a 
trade off between the uncertainty of each observable. In 
particular, if 6o is deterministic, inequality (38) implies 
inc[6o,fci] = 0, and hence, nothing prevents bi from 
being deterministic too. It is also remarkable that, a 
deterministic observable is compatible with any other. 

Notice that in some of the proofs in this article, 
we express distributions in terms of nonlocal extreme 
points. But, some nonsignaling theories may not in- 
clude them, like for example, QM does not include PR 
correlations (11). It is important to stress that this 
is not an inconvenient for the validity of the proofs 
when applied to any particular theory. For instance, 
although QM does not predict PR correlations we can 
always write some quantum mechanical correlations as 
a mixture of PR and local ones. 



VII. CONCLUSIONS 

In this work, we have identified a series of features 
common to all physical theories that do not allow for 
instantaneous transmission of information, and predict 
the violation of Bell inequalities. As shown, these two 
assumptions are sufficient to prove: 

• Constraints on how nonlocality is distributed 
among the correlations of different pairs of parti- 
cles in multipartite scenarios. 

• Impossibility of perfect cloning of states. 

• Strict equivalence of the following properties: 



1. nonlocality 

2. bounded shareability 

3. positive secrecy content 

• A relation for the incompatibility of two observ- 
ables and the uncertainty of their outcomes. 

Hence, some properties traditionally attributed to QM 
are generic within this family of physical theories. For 
example: the fact that two observables cannot be si- 
multaneously measured on the same system (incompat- 
ibility), becomes necessary to explain the correlations 
observed in some experiments [violation of CHSH [2]] , 
independently of the fact that we use models based on 
noncommuting operators to explain such experiments 
(see also [12]). Moreover, a no-cloning theorem can 
be derived without invoking any nonorthogonality of 
states of linearity of the evolution. 

This indicates how constraining is the demand that a 
theory compatible with special relativity predicts the 
violation of Bell inequalities. One could actually say 
that there is not much room left out of QM. 

From a more fundamental point of view, this work 
proposes a different approach to the study of quantum 
properties. In general, QM has been studied in com- 
parison with Classical Mechanics, that is, starting from 
a more restrictive theory. Here, the idea is to start from 
a more general family of theory, and to study "quan- 
tum" properties common to all them. It is then an 
open research project to identify those additional pos- 
tulates that allow one to recover the whole quantum 
structure. 
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Appendix A. Depolarization and shrinking 

In this appendix it is shown that, in the case X = 
Y = A = B = 2, any distribution can be transformed 
into an isotropic one maintaining the CHSH violation 
(8) invariant. We call this process depolarization. We 
also show that the parameter C of an isotropic distri- 
bution can be decreased with local operations. We call 
this operation shrinking. 

Depolarization: this transformation can be imple- 
mented by using 3 bits of shared randomness and local 

operations, in the following two steps: 

First step, Alice and Bob perform with probability 
1/2 one of the following two operations: 

1. Nothing 

2. FUp a and b 

This makes the correlations locally unbiased. 

Second step, with probability 1/4 both parties per- 
form one of the following four operations: 

1. Nothing 



2. Flip ax=i and y 

3. Flip X and by^i 

4. Flip X, ax=o, y and bb=i 

where flipping a^^i means that a is only flipped when 
X = I, that is a a + X mod 2. After the second 
step, the resulting correlations satisfy (9). It can be 
seen that both steps keep invariant the violation of the 
CHSH inequality. 

Shrinking: a useful observation is that when C > 0, 
the value of Schsh can always be decreased by perform- 
ing an operation in one site. This is accomplished when 
one party, say Bob, outputs b with probability 1 — e, and 
an unbiased random bit with probability e. This oper- 
ation implements the transformation: C — > (1 — e)C. 

Appendix B. Proofs of section 6 

Result: In the case A = B = X = Y = 2 the degree 
of incompatibility of two observables is 

inc[bo,bi] = Bchsh[P] . (42) 

Proof. The minimization in the definition of 
inc[6o, bi] in (36), is completely equivalent to the mini- 
mization of in the optimal eavesdropping extension 
(Appendix B). Then, we just have to substitute /x by 
Pnl which gives the equality (42). 

Result 6: In the binary output case [A = B = 2] 
the following constraints hold: 

H{bo) > /iQinc[6o,6i]) , (43) 

H{bi) > /iQinc[6o,6i]) , (44) 

where II{b) is the entropy of the output b, and h{x) 
is the binary entropy of x [13]. This inequalities also 
hold in the binary input case [X = Y = 2], and are 
still tight. 

Proof. Let us prove the above inequalities (43,44) 

for the binary output case. It is shown in this case [9] 
that, for all extreme points, the one party marginals are 
deterministic or unbiased: [P{b = 0\y), P{b = l\y)] G 
{[0, 1], [1,0], [1/2, 1/2]}. In the next we see that, if one 
observable, say y = 0, is deterministic [P(6o|0) = 0, 1] 
then it is compatible with all the rest. To see this 
suppose that the outcome of bo is always bo = /3, 
then, for any y, the joint distribution P{a, bo, by\x, y) = 
P{a,by\x,y) Shg^p exists. Then, bo and by are compat- 
ible by definition. Now, let us decompose 

PiNc as a 

mixture of extreme points. This mixture must not 
contain extreme points having the marginal of bo or 
the marginal of bi deterministic. Otherwise, one could 
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move this extreme point to the mixture of compati- 
ble ones -PcoMj decreasing the vahie of rj. Thus, the 
marginals for bo and bi taken from Pinc are always un- 
biased. Therefore, inc[6o,&i] is the probability of get- 
ting with certainty an unbiased outcome. The situation 
where 60 and hi have minimal entropy is when Pcom 
is deterministic. Suppose that Pcom(& = 0\y = 0) = 1, 
then recalling (36) 

P{b = l\y = 0) = inc[6o,6i]PiNc(6 = l|2/ = 0) 

= hnc[bo,bi], (45) 

and thus the entropy of bo is H{bo) = /i(inc[6o, fei]/2). 
The same holds for 61. In general, when Pcom is not de- 
terministic, the entropies will be larger than the bounds 
(43,44). 

Let us prove that the bounds (43,44) also hold in the 



case where inputs are binary, and the outputs belong 
to larger alphabets. In that case, all extreme points 
have been classified in [4]. There, it is shown that, 
all extreme points have local marginals where all out- 
comes with non-zero probability are equiprobable. As 
discussed before, if we write Pmc as a mixture of ex- 
treme points, the marginals for 60 and bi given by these 
extreme points must have at least two outcomes with 
nonzero probability. Otherwise the two observables are 
compatible and we can attach the extreme point to 
Pino decreasing r]. The situation where 60 and bi have 
minimal entropy is when Pcom is deterministic, and 
PiNc has only two outcomes with nonzero probability 
for 60 and 61. In such case, the inequalities (43,44) are 
saturated. When Pinc has more than two outcomes 
with nonzero probability for 60 and 61, the entropies 
will be larger. 



